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IN THE UNITED STATES DESIGNATED OFFICE (DO/US) 

In re: Application of Houslay et al. 
Serial No.: to be assigned 
Filed: concurrently herewith 

For: POLYNUCLEOTIDE SEQUENCING METHOD AND KITS THEREFOR 

Date: August 25, 2000 

BOX PCT 

Commissioner for Patents 
Washington, DC 20231 

PRELIMINARY AMENDMENT 

Dear Sirs: 

Prior to the examination of the above application, please enter the 
following amendment. 

In the Claims : 

4. (Amended) The method according to [any preceeding] claim 1 
wherein the polynucleotide is cleaved with two or more of said restriction 
enzymes. 

5. (Amended) The method according to [any preceeding] claiml 
wherein the recessed 3'-ends are filled in by employing a DNA polymerase 
and a mixture of deoxynucleotide triphosphates containing dATP, dCTP, 
dGTP and dTTP, so as to generate substantially blunt-ends. 



7. (Amended) The method according to [any proceeding] claim 1 
wherein the blunt-ended fragments possess a single adenine 3'-overhang 
and the cloning of said fragments is facilitated using a cleaved vector 
comprising single thymidine 5-overhangs at the cleavage site. 

8. (Amended) The method according to [any proceeding] claim 1 
wherein the pairing of the matching ends and ordering of the fragments into a 
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contiguous over-lapping arrangement is carried out by using a computer 
program designed for such an application. 

9. (Amended) The method according to [any proceeding] claim 1 
wherein reading of said nucleotide sequence from said contiguous 
arrangement is carried out by or with the assistance of a computer. 

10. (Amended) The method according to [any one of claims 1 to 8] 
claim 1 for use in conducting restriction mapping of a polynucleotide. 



11. (Amended) A computer program for use with the method according 
to [any preceeding] claim 1 wherein the computer program serves to pair the 
matching ends of the sequenced fragments and order the fragments into a 
contiguous overlapping arrangement. _ 




13. (Amended) An semi-automated or fully automated sequencing 
^ apparatus with a dedicated computer comprising the computer program 

according to [either of claims 1 1 or] claim 12. 

14. (Amended) A kit suitable for use with the method according to 
[any one of claims 1 to 10] claim 1 wherein the kit comprises at least one of 
said restriction enzymes and a DNA polymerase(s) for the filling-in and/or 
sequencing reactions. _ 



18. (Amended) A kit according to [any one of claims 14 to 17] claim 17 
further comprising a computer program [according to either of claims 1 1 or 
12] in machine readable form. 
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Claims 1-18 are presented for examination. The above claims have 
been amended to better conform to U.S. practice. Applicants respectfully 
request substantive examination on the merits. 



Correspondence Address: 
USPTO Customer No.: 20792 
Myers Bigel Sibley & Sajovec 
Post Office Box 37428 
Raleigh, NC 27627 
Telephone (919) 854-1400 
Facsimile (919) 854-1401 



"Express Mail" mailing label number EL481 797464US 
Date of Deposit: August 25, 2000 

I hereby certify that this paper or fee is being deposited with the United States Postal Service "Express Mail Post 
Office to itadressee" service ur\6ey37 CFR 1 .10 on the date indicated above and is addressed to BOX PCT, 



Luclfle N. Gillette 7 
Date of Signature: August 25, 2000 
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PQIiYNUOIiEQTXPK SBQtfgNCTNG MEtPyon RWn KTTS TTTRRffypp 

The present invention relates to a method of 
sequencing a polynucleotide utilising restriction enzymes 
which cleave the polynucleotide at a site away from the 
restriction enzyme's recognition site and to kits for use 
with such a method. 

With the advent of genetic engineering it has become 
possible to isolate polynucleotide fragments and "to 
determine their nucleotide sequence. Typically the 
polynucleotide fragment of interest is first amplified in 
order to generate enough sequencing template, prior to 
determining its polynucleotide sequence. This may be 
achieved, for example using polymerase chain reaction (PCR) 
techniques or polynucleotide cloning methodologies. 

However/ it is generally difficult to sequence large 
polynucleotide (eg. DMA) fragments (ie* greater than about 
5aobp-lkbp) , due to the limitations of sequencing 
methodologies* It is often therefore desirable to cleave 
large fragments into more manageable smaller fragments and 
to sequence these smaller fragments. The sequences 
determined can then be reassembled into a single 
polynucleotide sequence „ 

One technique of obtaining smaller fragments is known 
as shotgun cloning. Typically, a large DNA fragment is 
completely digested, using a frequent cutting restriction 
enzyme, such as Sau3AI, into much smaller fragments. A 
vector, for example a piasmid, is digested with a rarer 
cutting enzyme (e.g, BamHI) , so that the vector is cut only 
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once and so as to give complementary ends to those 
generated by the frequent cutting enzyme. The small sau3AI 
digested polynucleotide fragments are then cloned into the 
vector to allow sequencing. 

However, such a strategy is not attractive because the 
ends of the DNA fragments produced by digestion are 
identical and so it is not easy to reassemble them into the 
order in which they occur in the large fragment without 
resorting to some form of restriction mapping . 
Additionally, it is possible to fail to identify colonies 
containing vectors with very small inserts since such 
colonies can appear blue using conventional blue/whit^ 
selection* Unless an accurate restriction map has been 
determined/ it is possible to fail to identify that such 
small inserts of sequence are missing from the whole 
sequence and consequently ascertain the polynucleotide 
sequence of the larger fragment incorrectly. 

Thus, it is generally necessary to perform further- 
sequencing experiments in order to confirm the restriction 
sites and ensure that all fragments have been cloned an<3 
sequenced. It will be appreciated that this process can be 
very time consuming and expensive to perform* 

An alternative is to carry out a partial digest, again 
using a restriction enzyme such as sau3AI, The partial 
digest is intended to generate a series of overlapping 
clones which can be sequenced and the matching sequences 
aligned so as to form a contiguous overlapping sequence. 



o *g & es :sn ?' is „ o jl is o 



WO 99/43845 PCT/GB99/00539 

3 

However, the conditions for carrying out the partial 
digestion have to be carefully controlled in order to 
prevent complete digestion, the control of which can be 
difficult to achieve. Moreover, a significant amount of 
overlapping sequence may be* generated which may lead to 
some sections of the DNA being unnecessarily sequenced, 
* which again wastes time and resources. 

Another system for sequencing large fragments of DNA 
is based on the procedure developed by Henikoff (Henikoff , 
S- (1984) Gene 351), in which exonuclease III (Exolll) 

is used to specifically digest DNA from a 5» protruding or 
blunt-end restriction site. The other end of the DNA is 
protected from digestion by Exolll by a 4-base 3' overhang 
restriction site or by an alpha-phosphorothioate filled 
end- 

Typically Exolll is added to a sample of linearised 
vector containing insert dna and digestion started. 
Samples of the Exolll digestion are removed at timed 
intervals and added to tubes containing SI nuclease, which 
removes the remaining single-stranded tails. The ends are 
blunt-ended and ligated to re-circularise the now deletion- 
containing vectors. 

The generation of ordered sets of deletions by this 
method relies on the uniform digestion rate of Exolll. 
However, Exolll will also digest from nicks in double- 
stranded DNA. it is therefore important to minimise the 
proportion of nicked molecules in the starting DNA, by 
purifying the DNA using special techniques. 



O *9£n E!3LO y 6 




o .1, ,;i £ o :l 




WO 99/43845 



PCT/GB99/00539 



4 



Moreover, the ExoXII process is generally only 
suitable if the restriction ensywe sites which linearise 
the vector are not present in the insert , the probability 
of which decreases with increasing insert size * 
Furthermore the ExollI process only results in DNA which 
decreases in size from one end, since the other end is not. 
digested. Thus, subsequent sequencing only generates new 
sequence from one end. 

There is thus the need for a more efficient and easier 
process which will allow large fragments of polynucleotides 
(e.g. DNA) to be sequenced. 

It is therefore among the objects of the present, 
invention to obviate and/or mitigate at least one of the 
above described disadvantages. 

The present invention provides a method of determining 
the nucleotide sequence of a polynucleotide, comprising the 
steps of : 

a) cleaving the polynucleotide with a restriction enzyme so 
as to generate two or more fragments, wherein the 
restriction enzyme cleaves the polynucleotide at a site 
away from the restriction en2yme 1 s recognition site so as 
to generate a cleaved site possessing a recessed 3 1 -end and 
a S 1 -overhang of undefined sequence; 

b) filling-in said recessed 3 1 -ends so as to form 
substantially blunt-ended fragments; 

c) cloning and sequencing said blunt-ended fragments; 

d) pairing matching blunt-ends of said blunt-ended 
fragments so as to allow said blunt-ended fragments to be 
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ordered in a contiguous over-lapping arrangement; and 

e) reading said nucleotide sequence from said contiguous 

arrangement. 

It is to be understood that the substantially blunt- 
ended fragments referred to above include fragments with 
true or perfect blunt-ends (ie blunt-ends which do not 
possess any overhang) as well as fragments which possess 
ends with a single-base overhang. 



isolated from the genome with which it is associated and 
optionally amplified, for example by PC& or cloning into a 
vector and amplifying the vector in a suitable host- 
Typically the polynucleotide may be greater than Ikb in 
length , for example greater than 10Kb or greater than 50- 
lOOkb in length - 

In theory the polynucleotide may be of any length. 
The suitability of said polynucleotide for sequencing will 
generally depend on the number and length of restriction 
fragments which are generated by cleavage with the 
restriction enzyme. 

Although the restriction enzymes cleave double- 
stranded DMA, the polynucleotide need not initially be 
double-stranded DNA* The polynucleotide can for example be 
single-stranded RNA which is converted to double-stranded 
cDNA by use of reverse transcriptase and DNA polymerase as 
is well known in the art. 



The polynucleotide to be sequenced is generally 
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The polynucleotide may be from any desired source. For 
example, the polynucleotide may be obtained from bacteria, 
plants, insects, viruses and animals* 

The restriction enzymes suitable for use in the 
present invention specifically generate 5 ' -overhangs of 
undefined sequence. The restriction enzyme identifies a 
• constant defined recognition site and cleaves the DNA 
within an adjacent undefined region which may consist of 
any sequence. An example of such an enzyme is Hgal* 

Hgal recognises the following recognition site with* 
the recognition sequence shown underlined: 

I 

s ' G A C n c NNNNNNNNNNNNNNN 3 ' 

3 . CTGCGNNNNNNNNNNNNNNNs. 

I 

where N represents any nucleotide base (eg. A, c, G or T) 
and the arrows show the point of cleavage. 

Thus, HgaX cleavage at this site generates two ends 
which both possess recessed 3' -ends and 5 '-overhangs of 
undefined sequence, one of which is: 

S *GACGCNNNNN 3 ' 

3 . CTGCGNNNKNNNNNNs. 
By convention recognition sequences are often only 
represented by one strand only, written from 5' - 3». For 
enzymes such as tfgral, which cleave away from their 
recognition sequence, the sites of cleavage are indicated 
by their position, or in parentheses. Thus, the 

recognition sequence of Hgal is often represented as: 
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GACGC(N) 5/1G, or 
GhCGC (5/ 10) 

which means that the enzyme recognises the sequence GACGC 
and cleaves the DNA within an adjacent region of any 
sequence , B bases away from the ©nd **c" of tiiti recognition 
sequence on the same strand and 10 bases away on the other 
strand. 

Examples of other restriction en2yrnes suitable for use 
in the present invention and their recognition sites are as 
follows: 

GTCTC(N) 1/5 
GCAGC(N)8/12 
GTCTC(N) 1/5 
GTCCC(N) 10/14 
GCAGC(N)8/12 
GGATG(N) 9/13 
GCATC(N) 5/9 



Alp/261 
Bbvl 
BsmhX 
BsmFl 
BstllX 
Fokl 
St Ml 
-EaJ»ll0 4I/ 
EctrX/Ksp632l 
BhsX/Bbvl6IT/ 
Bpll/BpuAX 
BsaX/Eco311 
BsmBl/Esp2X 
BspHl 
GtixXX 

sapx 



CTCTTC(N) 1/4 

GAAGAC(N) 2/6 
GGTCTC(N) 1/5 
CGTCTC(H) X/5 
ACCTGC(N)4/S 
CGGCC(A/G) (N) 1/5 
GCTCTTC (N) 1/4 
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Any restriction enzyme which generates a 5 » -overhang 
of undefined sequence may be used in the present invention. 
However, it is preferable that the overhang be 3 or metre* 
bases in length in order to minimise the probability of e* 
chance overlap match, as will be explained in detail below. 

Typically, the recessed 3*-ends are filled in £>y 
• employing a DNA polymerase and a mixture of deoxy— 
nucleotide triphosphates (dMTPs) , ie* a mixture containing 
dATP, dCTP, dGTP and dTTP, so as to generate substantially 
blunt-ends. DMA polymerases possess the ability to a^dt 
nucleotides onto an available 3* -oh group of a. 
polynucleotide chain, but cannot add bases to the 5'- 
phospate group. 

The skilled addressee is aware that DNA polymerases 
that have a "proofreading" function , such as DNA polymerase 
I, P£u and Wi exhibit 3* - exonuclease activity and. 

produce .greater than 95% blunt-ended fragments- However, 
certain thermostable polymerases including Tag, t£1 and Tth 
polymerase add a single nucleotide, preferentially adenine, 
to the 3' -end, so as to form a blunt-end possessing a 
single additional base overhang . However, the single 
nucleotide overhang can be used to assist with the cloning 
of the DNA, since perfectly blunt-ended fragments can be 
more difficult to clone. 

The substantially blunt-ended fragments (ie. perfectly 
blunt-ended fragments or blunt-ended fragments possessing- 
a single base overhang) are cloned into an appropriately 
digested vector, such as a plasmid, phagemid or phage 
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cloning vector- Typically the blunt-ended fragments are 
cloned into a so-called polycloning region of such vectors 
which possesses a number of unique restriction enzyme 
sites „ 

The polycloning region may for ftKample be digested 
with a restriction enzyme which generates blunt ends, such 
as SmaX or HxncXX or alternatively digested with any 
restriction en2yme which generates a 5' -overhang , since 
this may also be filled-in by a filling in reaction, to 
allow cloning of the substantially blunt-ended fragments. 

Blunt-ended fragments which possess a single adenine 
overhang may be cloned into so-called "T-tailed vectors", 
or f, TA cloning vectors" such as the pGEM<&-T vector systems 
available from Promega, Southampton, UK, using techniques 
previously described in the art (see for example Clark/ J- 
(1988) Nucleic Acids Research 9677 - 9686). 

Once the blunt-ended fragments have been cloned their 
nucleotide sequence may be determined using conventional 
DNA sequencing methods well Known in the art* In 
particular, the sequence of the previously undefined S 1 - 
overhang region of the cleavage site, which was blunt-ended 
by the filling-in process, is determined* 



identically complementary 5 ' -overhangs , albeit of initially 
undefined sequence, sequencing of individual clones helps 
identify which fragment ends were generated by a particular 
cleavage reaction. This is made possible due to the nature 
of the restriction enzymes used which generate variable 5*- 



Since a single cleavage reaction generates two 
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overhangs . 

The chances of two 5 '-overhangs, generated by separate 
cleavage reactions at different points in th.e 
polynucleotide sequence, being accidentally the same, Ls 
calculated as 4 (which is the number of possible bases ie 
A, c, G or T) raised to the power of the length of 5'- 
• overhang. Thus for a restriction enzyme which generates a 
3-base S» -overhang of undefined sequence, the chances of 
any two separate S ' -overhangs being the same is i:4 3 or 
1:64. For a restriction enzyme which generates a 5-base 
5 '-overhang, the chances of any two separate 5 ' -overhangs 
being the same is i;i024. Therefore, providing that 
relatively few fragments are generated by a particular 
restriction enzyme, in comparison with the probability of 
a chance match between any two separate 5 • -overhangs, it is 
possible to pair matching ends with a high degree of 
certainty that they were generated from the same cleavage 
reaction at a given point in the polynucleotide sequence. 

in this manner it is. possible to identify all matching 
ends by their sequence. The matching ends can then be 
paired and the fragments ordered so as to allow a 
contiguous over-lapping arrangement of sequences to be 
generated, from which the nucleotide sequence of the 
polynucleotide may be determined. Typically, pairing of 
the matching ends and ordering of the fragments into a 
contiguous over-lapping arrangement may be carried out by 
using a computer program designed for such an application. 
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Reading of said nucleotide sequence from said 



contiguous arrangement may then also be carried out by or 
with the assistance of a computer. 

It may be appreciated that the method described herein 
may be used in conjunction with manual, semi-automated or 
fully automated sequencing apparatus known in the art. 

In manual sequencing the scientist typically reads the 
sequence off an autoradiograph taken from a gel, on which 
radioactive or chemi luminescent DNA fragments have been 
separated according to size by electrophoresis* Such 
techniques are well known in the art and are described for 
example in Sambrook, J et al (1989) Molecular Cloning; a 
laboratory manual, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, The sequent is then conveniently 

entered into a computer to facilitate observation and/or 
manipulation of the sequence using appropriate computer 
software. However, manual sequencing is being circumvented 
by semi-automated cr fully automated sequencing apparatus 
which can not only determine the sequence of a particular 
polynucleotide, but can input this information directly 
into a computer comprising appropriate sequence handling 
computer software. 

It is therefore immediately evident that a computer 
program designed for pairing of the matching ends and 
ordering of the fragments into a contiguous over-lapping 
arrangement may be provided which is suitable for use with 
the method of the present invention when using manual, 
semi-automated, and /or fully automated sequencing 
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apparatus. For example it; may be possible to provide 
suitable software for use in conjunction with a semi— 
automated or fully automated sequencing apparatus such th«it 
the fragments generated using the method of the present: 
invention may be sequenced using a single apparatus linked 
to a computer comprising the computer software* The 
sequences of the various fragments are determined using thea 
sequencing apparatus, and the software is able to pair the 
matching ends and order the fragments into a contiguous 
over-lapping arrangement. Thereafter the software is able 
to determine the sequence of said nucleotide from said: 
contiguous arrangement and provided the user with a single 
nucleotide sequence corresponding to the original 
polynucleotide . 



provides a computer program for use with the method as 
described herein , wherein the computer program serves to 
pair the matching ends of the sequenced fragments and order 
the fragments into a contiguous overlapping arrangement r 
thereafter the computer program may read from the 
contiguous overlapping arrangement and provide the user 
with the nucleotide sequence of the original 
polynucleotide. Such a computer program may be provided to 
a user of the present invention on a computer readable 
medium such as a floppy disk, CD-ROM or the like^ 
Alternatively semi-automated or fully automated sequence 
apparatus with a dedicated computer may be provided with 
the computer program preloaded into the computer's memory ~ 



Thus in a further aspect the present invention 



o •«■;!! e, iiiAe y ,„ o jl. ± a o . 



W ° 99/43845 PCT/GB99/00539 

13 

In order to help better understand the process of 
pairing the matching ends and ordering the sequences, 
reference is made to Figure i which shows the process 
schematically. 

Part A of Figure i shows fivo fragments (i to 5) which 
were generated from a single polynucleotide fragment which 
■ had been cleaved with a restriction enzyme as defined 
above. The fragments have been blunt-ended by filling-in 
as described, cloned and sequenced. The small regions of 
sequence corresponding to the S ' -overlaps generated by the 
restriction enzyme are shown as different symbols. To a 
high degree of certainty only the ends generated by a 
particular cleavage reaction will be the same. Thus, for 
example, the right hand end of fragment 5 matches the left 
hand end of fragment 3. 

By pairing the matching ends of the fragments it is 
possible to order the fragments in a contiguous overlapping 
linear arrangement as represented in part B of Figure l. 
once the fragments are ordered as shown in part B, the 
nucleotide sequence of the original polynucleotide can be 
easily determined (as shown in part c of Figure 1) . 

In the example as represented by Figure l only two 
individual ends match with one another. when only a few 
fragments are generated the likelihood of more than two 
ends matching is remote. indeed Table i shows the 
estimated average length of DNA, that would be expected 
before identical restriction sites for each particular 
restriction enzyme would be observed. 
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Where there are random overlaps, more than one 
contiguous arrangement permutation is possible. However, 
most permutations can be discounted immediately, foor 
example, permutations that produce a circular contiguous 
arrangement for a DNA fragment that is linear - 
Additionally the polynucleotide could be cleaved using a 
' different restriction enzyme or a partial digest performed 
in order to assist in ordering the fragments. 

The present process may also be used to conduct: 
restriction mapping of a polynucleotide* To achieve this, 
it is not necessary to sequence the entire length of each 
fragment, only the blunt-ends generated from the 
restriction enzyme digestion and filling^in reaction need 
be sequenced. It is then possible to order the fragments 
as described above in order to generate a restriction map. 

In another aspect the present invention provides a kit: 
suitable for use in any of said processes according to the 
present invention, the kit comprising at least one 
restriction enzyme as defined herein together with a DNA 
polymerase or polymerases for the filling-in and/or 
sequencing reactions . other components such a£ dNTPs, a To- 
talled vector, competent cells, sequencing reagents and the 
like may also be included as appropriate. In addition a 
computer program in a machine readable form such as a 
computer disk or CD-ROM may be provided for pairing the 
matching ends and ordering the fragments into a contiguous 
overlapping arrangement and thereafter providing the 
nucleotide sequence of the polynucleotide. 
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The present, invention will be further described and 
understood with reference to the following non-limiting 
Examples section. 



Exampl e s Section 
Mafrftrialg ft Mafcfrofls 
• i. Restriction Enzyme Digests 

All restriction eri2yme digests were performed on pure 
DNA using restriction enzymes supplied by Promega (Promega, 
Southampton, UK) or Hew England Biolabs (New England 
Biolabs, Hitchin, UK) - Incubation conditions were 37°c for 
a minimum of l hour using the appropriate buffer supplied 
by Promega or New England Biolabs* Following digestion DNA 
was run on an agarose gel and gel extracted using Qiaexll 
gel extraction Kit (Qiagen, Crawley, UK) . 

2. Extraction of DNA from agarose .gels with the QIAEX II 
gel extraction Jcit 

All DNA extracted from gels was purified using the 
QIAEX IX DNA gel extraction kit according to the 
manufacturer's instructions- Briefly, three volumes of 
'GX-l 1 buffer and 10^1 of Qiaexll DNA binding beads were 
added to each gel plug. The plugs were dissolved by 
warming to 50 °c during which time the beads were kept 
suspended by vortexiny «very 2 min* After 10 rain the beads 
were pelleted by a 20s centrif ugation in a benchtop 
centrifuge. The supernatant was removed and the pellet 
washed in 500/il *QX-l' buffer, resuspended, and then 
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pelleted in the same manner as above. The pellet was thein 
washed, resuspended and pelleted similarly in an ethanolic 
wash 'PE' buffer. The pellet was then allowed to dry for 
10 *nin and then eluted in 20^1 of water- This DNA was 
typically contaminated with ethanol and so was subsequently 
purified by ethanol precipitation. 

3, Ethanol precipitation 

To the volume of DNA to be ethanol precipitated, 0.x 
volume 3H sodium acetate was added and 2 volumes of 100% 
ethanol. The vial was mixed and incubated at -80°c for 3to 
minutes. The precipitated suspension was centrifuged at 
liooorpm in a Jouan (MR1812) refrigerated centrifuge for lO 
min to pellet the DNA - The supernatant was aspirated arx<3 
lml of 70% ethanol added* The DNA was pelleted again &y 
centrif ugatian at liooorpm in the refrigerated centrifuge 
for 5 min, the supernatant aspirated and the pellet allowed 
to air-dry for 5-10 minutes. The DNA was resuspended in 
buffer and the purity of the DNA checked by UV absorption 
at 260nm and 2S0nm, where A260/A280-1 . 8 for pure plasmia 
DNA. 

4- Generation of plasmid DNA 

Plasmid DNA was prepared using maxiprep and miniprep 
Kits (Promega, Southampton, UK) . a orief protocol for a. 
Promega maxiprep kit is given below- 



# 
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Promega maxi-prep 

A culture was set up by stabbing a toothpick into 
frozen glycerol stocks and adding it to 400inl of ampicillin 
(50j*g/ml) LB medium* The culture was incubated overnight 
at 37°C in a rotating incubator at SOOrpm. 

" preparation of cleared lysate 

The culture was then poured equally into 250ml Beckman 
centrifuge tubes and pelleted at 950og for 10 mins at room 
temperature in a JA-14 rotor. Each pellet was resuspended 
in 7* 5ml 1 Resuspension solution 1 using a heat-sealed 5ml 
pipette to manually disrupt the pellet. These suspensions 
were combined. To the combined 15ml, 15ml •Cell Lysis 1 
solution was added and mixed by inversion* Lysis was 
allowed to complete (up to 20 min) and then 15ml of 
'Neutralisation solution 1 was added and immediately mixed 
by inversion. 



transferred to a new container. 

Flasmid DNA precipitation 

0,6 volumes of isopropanol was now added and mixed by 



centrifugation at 14 / uu0g for 15 mins at room temperature. 
The supernatant was discarded and the DNA pellet 
resuspended in 2ml TE. 



The suspension was centrifuged at 14,000g for 15 min 



at room temperature* 



The cleared supernatant was 



inverting several times* 



The DNA was pelleted by 



WO 99/43845 PCT/GB99/00539 

18 

Plasniia purification 

One Maxicolumn was inserted into a vacuum manifold* 

10ml of well-shaken pre-warmed 'DNA purification resin* was 
added to the DNA/TE solution and then this slurry was addetf 
to the maxicolumn; A vacuum was applied to draw the slurrry 
through. The DNA/resin contained was rinsed with 13ml of 

'Column wash solution' and immediately added to the column 
under vacuum- A final wash of 12ml of 'column wash 
solution' was then added to the column. The resin was 
rinsed with 5ml of 8 0% isopropanol under vacuum. 

The resin was dried by centrifuging the column in its 
soml conical tube in a bench-top clinical centrifuge at 
2,500 rpm (13 00 g) for 5 min. It was then transferred to 
a new 50ml conical centrifuge tube. 1.5 ml pre-heated 
water <65-70°c) was applied to the tube. After l minute 
this water was centrifuged out of the column using the* 
conditions above, 

DNA solution was stored -20*C- 



5* Cloning DNA 

Phosphatase treatment of DNA 

If appropriate (ie not necessary for TA-cloning 
vectors) prior to ligation of an insert into a vector, the 
plasmid DNA was treated with calf intestinal alkaline 
phosphatase (CIAP) if the vector had been digested with a 
single restriction en^yme. The clAP removes the 5» 
phosphate groups and thus prevents recircularifcation of the 
vector during ligation- 
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Reaction mix 

The following was added to a microcentrifuge tube: 

vector DNA 

CIAP lOx buffer 

CIAP 

dH 2 0 

This was mixed gently and incubated for 1 hour at 37 °c. 
CXAP was removed prior to ligation by phenol/chloroform 
extraction. 

Double stranded DNA ligation 

Double stranded DNA with cohesive ends was ligated 
into lOOng vector by adding 1 unit of T4 DNA ligase 
(Promega) to 1:1 and 1:3 ratios of vector and insert DNA in 
19.5m1 1 x Ligase buffer (10X T4 DNA ligase buffer is 30mH 
Tris-Hcl, pH7.8- lOOmM MgCl 2/ XOOmM DTT t 10mM ATP). This 
reaction was incubated at I4 a c overnight* The ligase buffer 
was aliquoted to prevent degradation of ATP . 

6, ta Cloning 
Sample preparation 

The DNA precipitate from an ethanol precipitation was 
resuspended in a volume such that the ratio of 
concentration of the average si2ed insert to vector would 
be 3:1 in the ligation. 
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Ligation 

TA cloning depends on a property which certain 
polymerases possess in transferring single adenines onto 
the 3' end of blunt-ended DMAs. Vectors carrying- 

complementary T overhangs can ligate with these DMAs vety 
efficiently because neither molecule can circularise thus 
• promoting intermodular reactions. TA cloning was 
performed using the Original TA cloning® kit or th»e 
Eukaryotic TA cloning® kit (both available from Invitrogem 
BV, NV Leek, Holland) (Bidirectional) as required. The 
ligation reaction is carried out essentially as above, 
using the supplied precut vector containing the T overhang. 

TA cloning: transformation 

An aliquot of frozen competent cells (either inv«F'or 
TOPiOF* supplied by Invitrogen) was thawed slowly on ice. 
2/tX of ligation reaction and 2/U of 0 . 5Mp-Mercaptoethano 1 
was added to the tube, mixed with the pipette tip and 
incubated on ice for 30 min. The cells were then heat 
shocked for 30s at 42°C and incubated on ice for a further 
2 min. 250 M 1 of SOC broth was then added and the 
transformed cells incubated at 37'c for 60 min with shaking 
(225r P m). ioomI of the culture was plated on a iocm agar 
plate containing 50 M g/ml ampicillin. Transformed colonies 
were identifiable in the 'Original- TA cloning vector 
(PCR2.1) using blue/white colour selection because of 
insertion into the fc-Galactosidase gene. Colour selection 
was not possible in the eukaryotic TA cloning expression 



n„j! -si' i£..!i !!.,.„ ™.i> .i' >Ln ,i. H...H „,a„ ,_n. it J „.L 



WO 99/43845 ^ „ 

FCT/GB99/00539 

21 

vector (pcR3.i). m either case white colonies were 
Picked, PCR screened to ensure an insert was present and 
glycerol stocks made of positive colonies. 

7, DNA sequanoing with tfte 2V8I sequencer 

Protocol for cycle sequencing 

Samples for sequencing taken from maxi-preps or mini- 
preps were mixed with the TaqDyeDeoxy Terminator (Applied 
Biosystems, Foster City, CA, USA) reaction premix. 

Reaction mix: 

Reaction premix 

(contains buffer, polymerase, dNTPs, ddNTPs, 
magnesium) a^l 

ds DNA template 400ng 

Primer (for ds DMA) 3.2pmol 
H,0 

to 20^1 

sequencing reactions so prepared were subjected to 
thermal cycling using the following conditions: 



Cycles (25) Denaturation 9 6 °C for 30s 

Annealing* 47« c for 15s 

Extension 60*c for 4min 

* This segment temperature was variable according to the 
primer used. The temperature shown was that used for the 
T7 sequencing primer (taatacgactcactataggg) and the pCR2 . i 
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Upstream primer (agctatgaccatgattacg) • 

Reaction products were concentrated by ethanol 
precipitation and the pellets sent to the Glasgow 
University Molecular Biology Support Unit for gel 
electrophoresis and sequence determination using an Applied 
Biosystems fluorescent sequencer model 393A. 



Sequence analysis 

Routine DNA sequence handling and analysis was 
performed on the Gene hockey XI program (Biosoft, 
Cambridge, UK) 



genera tion Of ffral digested f^qffl ^f cf 

A 2*4kb Xho II fragment which had been cloned into a 
plasmid vector was re-excised using flanking Ecor i site«3 
and the resulting fragment was digested, as described 
above, with an l unit excess amounts appropriate or of HgaT 
restriction endonuclease. 

Digestion generated four separate fragments, two of 
o.4kb, one of o.7kb and one of o.9kb* The fragments where 
separated by gel electrophoresis and purified from 
appropriate gel fragments, as described above. 
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Example 2 

Cloning the ffaaX digested fraq^enfcg 

The recessed 3' -ends generated by Hgal and Xho II 
digestion of the purified fragments were filled in using 
0\a<y polymerase and dNTPa as outlined below. 

Filing in reaction as follows: 
Gel purified DNA (dissolved in 

water) to final volume 50^1 

Taq polymerase 10X reaction 

buffer (Pr omega) 5^1 

dNTPs(2jnH of each dNTP) 5^1 

2SmH MgCl 2 3/xl 

Taq polymerase (Protaega) 2,5 units 

Incubated at 65°C for 10 min* 

A single TA ligation reaction was set up with the 
filled-in fragments and "original" TA vector as described 
above. 

After ligation was completed invar* competent cells 
were transformed and plated on agar plates, containing 
IPTG/Xgal for blue/white colour selection, and colonies 
allowed to develop* 

Plasmid DNA was prepared from a selection of colonies 
and screened using PCR techniques in order to ensure insert 
was present in the vector. 

Insert containing plasmid DNA was then prepared for 
DNA sequencing. 
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£g<TUQngina the cloned fragment? and ordering the secpence^ 
into a contiguous arrangement 

Sequencing of the plasmid DMA was carried out cas 
described above and the sequence obtained subjected to 
sequence analysis - 

Figure 2 shows the short unique overlaps which weire 
introduced into the polynucleotide fragment following 
digestion with Hgal. The solid underlines show the tfg-ar 
recognition sequences and the. bold GATC motif at the ends 
of fragments l and 4 are the ends of the XhoZX fragment 
fallowing TA cloning (recognition sequence RGATcy where R 
~ G/A and Y = C/T) . 

The entire sequence of the fragments is not shown 
since it is not necessary far the understanding of the 
underlying principle of the present invention. These short 
unique overlaps allowed the two 0,4kb, one o,7kb and one 
0„9)cb fragments to be ordered into a contiguous over lapping- 
arrangement as shown in Figure 3a, Figure 3b shows the 
Jfgral restriction map of the original polynucleotide 
fragment. 

As can be seen from Figure 2 and Figure 3b, the 
junction between fragments 3 and 4 was more complex than 
the other junctions, due to two Hgal restriction sites 
being extremely close to each other. However, in such c* 
situation, digestion at one site effectively destroys the 
other such that an overlap can still be discerned from some 
clones . 
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TABLE 3, 



Restriction 


Recogni tion 


Fraqfuency 
of cutting 
(top) 


Probability 
at chitnca 
overlap 
match 1 (bp) 


Length 

of r**a 

match 2 
(kb> 


Xlw26r 

"Pvi 

Bst7H 

Fokx 1 


GTCTC (W) 1/5 

GCAGC IN) 0/12 
GTCTC <N> 1/5 
GTCCC (N> 10/14 
GCAGC(tt) 8/12 
GGATG (N) S/13 
<3CATC <N>5/9 


1 in 512 


1 in 255 


131Kb 


H<3fal 


GACGC 5/10 


1 in 512 


1 in 1024 


52 45* 


EamllG4l/ 
Earf/Ksp632I 


CTCTXC (K) 1/4 


1 in 204B 


1 in S4 


131Kb 


BbsX/BbvlGXX/ 

BbaX/ScoSIX 
&3tttBX/£4£>3X 
BspWT 

Gdiix 4 


GA&GAC (N) 2/6 

GGTCTC {N> 1/5 
CGTCTC (N> 1/5 
ACCTGC <N) 4/a 
CGGCCR<N) 1/5 


1 in 2048 


1 in 256 


52 4Kb 


Sapl 


<3CTCTTC (M) 1/4 


1 in 8192 


1 in 64 


524Kb 



1 Probability of chance match between two unrelated 



overhangs 

* Estimated length of DNA before a random match between 
unrelated recognition sequence^. (Example calculation: 
A1U2 6I will cut on average once every 5i2bp, Each 
overhang has a 1 in 2 56 chance of matching. Thus 
estimated the length of DNA before two identical 
recognition sequences are observed is: 
the frequency of cutting x the chances of matching 
ie. 512 bp x 256bp = 131Kb* 
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CLAIM? 



1, -""*A method of determining the nucleotide sequence of a 

polynucleotide, comprising the steps of; 

a) cleaving the polynucleotide with a restriction 
enzyme so as to generate two or more fragments, 
wherein the restriction enzyme cleaves ttiei 
polynucleotide at a site away from the restriction 
enzyme's recognition site so as to generate a cleaved 
site possessing a recessed 3 ' -end and a 5* -overhang 
undefined sequence ; 

b) filling-in said recessed 3'-ends so as to form 
substantially blunt-ended fragments; 

c> cloning and sequencing said blunt-ended fragments; 

d) pairing matching blunt-ends of said blunt~end^<i 
fragments so as to allow said blunt-ended fragments to 
be ordered in a contiguous over-lapping arrangement:; 
and 

e) reading said nucleotide sequence from said, 
contiguous arrangement. 

2 . The method according to claim 1 wherein tine 
restriction enzyme generates a 5 1 -overhang of 3 or 
more bases in length* 
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3. The method according to claim 2 wherein the 
restriction enzyme is selected from HgaX , A1W26X, 
BbvX, BsmAX, BsmFX, Bstlil, FoJcx , sjfaNI, J£amli04l, 
EarX, Ksp632X, BbsX, BbvieXX, Bpil, BpuAI, Bsal, 
Ecozix, Bsmm, Espix, BspHT , GdiXX and/or 5apJ, 



4, The method according to any pyeceeding claim wherein 
the polynucleotide is cleaved jvith two or more of said 
restriction enzymes. 

5, The method according /op dny proceeding claim wherein 
the recessed 3 1 -ends araf filled in by employing a DNA 
polymerase and a / mixture of deoxynucleotide 
triphosphates containing dATP, dCTF, dGTP and dTTP, so 
as to generate substantially blunt-ends - 

6, The method according to claim 5 wherein the DNA 



polymerase is DNA polymerase I, Pfu polymerase, Tlx 
polymerase , Tag polymerase, Tfl polymerase or Tth 
polymerase, 

7, The method according to aAy proceeding claim wherein 
the blunt-ended fragments/possess a single adenine 3'- 
overhang and the closing of said fragments is 
facilitated using a cYeaved vector comprising single 
thymidine 5 ' -overhangs at the cleavage site. 
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11, 



12 , 



The method according to ziyiy preceeding claim wherein 
the pairing of the matching ends and ordering of the 
fragments into a fuous over-lapping arrangement 

is carried out by us/ng a computer program designed 
for such an application- 

The method according to any proceeding claim wherein 
reading of said nucleotid^ sequence from said 
contiguous arrangement is carried out by or with th\e* 
assistance of a computer, / 

The method according to/ any one of claims l to 8 for 



use in conduct^igy restriction mapping of a. 

polynucleotide, / 

A computer programr for use with the method according 
to any preceeding claim wherein the computer program 
serves to pair/ the matching ends of the sequenced 
fragments and /order the fragments into a contiguous 
overlapping arrangement. 

The computer program according to claim 11 which 
further reads from the contiguous overlapping 
arrangement and provides the user with the nucleotide 
sequence of the polynucleotide. 
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13. An semi-automated or fully automated sequencing 
apparatus with a dedicated computer comprising the 
computer program according to either of claims 11 or 

12 . 



14. A kit suitable for use with the method according to 
any one of claims l to 10 wherein the kit comprises at 
least one of said restriction enzymes and a DNA 
polymerase (s) for the filling~in and/or sequencing 
reactions • 

15. A kit according to claim 14 further comprising a 
vector for cloning the substantially blunt-ended 
fragments . 



16. A kit according to claim is wherein the vector is a 
cleaved vector comprising single thymidine 5'- 
overhangs at the cleavage site. 

17. A kit according to claim 16 wherein the vector is a 
pGEM®-T vector or a TA Cloning® Vector, 



18* A kit according to an| one of claim5_l4 to 17 further 
comprising a compQ^e/ program according to either of 
claims ii or 12 in Machine readable form- 
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OVERLAP OF FRAGMENTS 1 AND 2 
GATCTGAAQGGTTTT. .0.4kb. TCCCTGGTGTTTGTG 

Mill 

TTGTGGGGACGCGIQ 

OVERLAP OF FRAGMENTS 2 AND 3 
Lo.4kb..GACGCCTTCGGCTTC 

Mill 

GCTTCTCCGCAGCTG 



OVERLAP OF FRAGMENTS 3 AND 4 
..CTCTCCGGGTCTCTC 
0.9kb....GGTCTCTCTCTCTGC 

I II I I 

TC TGCGTC C TGCGTC ..0.7kb..CCCTCCGGGCAGATC 

Fig. 2 

CONTIG OF OVERLAPPING CLONES 
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1 0.4kb 
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2.4kb 

0.7kb 



Fig. 3a 

Hga I RESTRICTION MAP 
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are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 
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POWER OF ATTORNEY: As a named inventor, I hereby appoint the practitioners 
associated with the Customer Number provided below to prosecute this application and to 
transact all business in the Patent and Trademark Office connected therewith, and direct that 
all correspondence be addressed to that Customer Number: 

Customer Number 20792 



Send correspondence to: 



Kenneth D. Sibley 



Myers^j^eLS ibley & Sajovec 
Pos t Office Box 37428 
"Raleigh, NC 27627 



Direct telephone calls to: 
Facsimile: 

Full name of (first/sole) inventor 



Kenneth D. Sibley 
(919)854-1400 

(919) 854-1401 

JMiX^-q noyiglas Houslay 



Inventor's 
Signature: 




4, 



Date: 



Residence: 

Citizenship: 

Post Office Address: 



Torrey Pines, Prieston Roa d, 
Bridge of Weir ^ PAll 3AJ 

United Kingdom 
British 



Torrey Pines, Prieston Roa d, 
Bridge of Weir PAll 3AJ 

United Kingdom 



Full name of second 

Inventor's 
Signature: 

Residence: 

Citizenship' 

Post Office Address: 



Neil Graham Rena 




1/L 12 Forfar Road 
Dundee DD4 7AR 
United Kingdom 
British 



1/L 12 Forfar Road 
Dundee DD4 7AR 



<b6 



United Kingdom 




